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1 Document Databases: Requirements for XML document database systems 
Airi Salminen, Frank Wm. Tompa 

November 2001 Proceedings of the 2001 ACM Symposi um on Document engineering 



Full text available: Qpdf(141.89 KB) 



Additional Information: full citation , abstract , references , citings , index 
terms 



The shift from SGML to XML has created new demands for managing structured documents. 
Many XML documents will be transient representations for the purpose of data exchange 
between different types of applications, but there will also be a need for effective means to 
manage persistent XML data as a database. In this paper we explore requirements for an 
XML database management system. The purpose of the paper is not to suggest a single 
type of system covering all necessary features. Instead the pur ... 

Keywords: XML, XML database systems, data definition, data manipulation, data 
modelling, structured documents 



A fine-grained access control system for XML documents | 
Ernesto Damiani, Sabrina De Capitani di Vimercati, Stefano Paraboschi, Pierangela Samarati 
May 2002 ACM Transactions on Information and System Security (TISSEC), Volume 5 
Issue 2 

» .I . , ■. u. « Jf/<1 on en ixd\ Additional Information: full citation , abstract , references , citings , index 

Full text available: TO pdf(330.60 KB) ± 

i^H^-* terms 

Web-based applications greatly increase information availability and ease of access, which 
is optimal for public information. The distribution and sharing of information via the Web 
that must be accessed in a selective way, such as electronic commerce transactions, require 
the definition and enforcement of security controls, ensuring that information will be 
accessible only to authorized entities. Different approaches have been proposed that 
address the problem of protecting information in a Web ... 

Keywords: Access control, World Wide Web, XML documents, authorizations specification 
and enforcement 



3 Document querying and transformation: A three-way merge for XML documents 
Tancred Lindholm 

October 2004 Proceedings of the 2004 ACM symposium on Document engineering 

Full text available: ^ pdf(500.99 KB) Additional Information: full citation , abstract , references , index terms 

Three-way merging is a technique that may be employed for reintegrating changes to a 
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document in cases where multiple independently modified copies have been made. While 
tools for three-way merge of ASCII text files exist in the form of the ubiquitous diff and 
patch tools these are of limited applicability to XML documents. 

We present a method for three-way merging of XML which is targeted at merging XML 
formats that model human-authored documents as ordered trees (e.g. rich text forma ... 

Keywords: XML, collaborative editing, conflict, structured text, three-way merge 



4 XML access control: Access control of XML documents considering update operations j| 
Chung-Hwan Lim, Seog Park, Sang H. Son 

October 2003 Proceedings of the 2003 ACM workshop on XML security 

Full text available: g pdf(298.78 KB) Additional Information: full citation , abstract , references , index terms 

As a large quantity of information is presented in XML format on the Web, there are 
increasing demands for XML security. Until now, research on XML security has been focused 
on the security of data communication using digital signatures or encryption technologies. 
As XML is also used for a data representation of data storage, XML security comes to 
involve not only communication security but also managerial security. Managerial security is 
guaranteed through access control, but existing XML acces ... 

Keywords: XML document, XML update, access control 



5 Model-driven development of Web applications: the AutoWeb system 
Piero Fraternali, Paolo Paolini 

October 2000 ACM Transactions on Information Systems (TOIS), volume 18 issue 4 

r- ., * ^ , ci , //>Aj( MD , Additional Information: full citation , abstract, references , citings, index 
Full text available: ^Tj pdf(6.94 MB) terms 

This paper describes a methodology for the development of WWW applications and a tool 
environment specifically tailored for the methodology. The methodology and the 
development environment are based upon models and techniques already used in the 
hypermedia, information systems, and software engineering fields, adapted and blended in 
an original mix. The foundation of the proposal is the conceptual design of WWW 
applications, using HDM-lite, a notation for the specification of structure, nav ... 

Keywords: HTML, WWW, application, development, intranet, modeling 



6 An analysis of XML database solutions for the management of MPEG-7 media 
descriptions 

Utz Westermann, Wolfgang Klas 

December 2003 ACM Computing Surveys (CSUR), Volume 35 issue 4 

Full text available: ^ pdf(448.76 KB) Additional Information: full citation , abstract , references , index terms 

MPEG-7 constitutes a promising standard for the description of multimedia content. It can 
be expected that a lot of applications based on MPEG-7 media descriptions will be set up in 
the near future. Therefore, means for the adequate management of large amounts of 
MPEG-7-compliant media descriptions are certainly desirable. Essentially, MPEG-7 media 
descriptions are XML documents following media description schemes defined with a variant 
of XML Schema. Thus, it is reasonable to investigate curren ... 

Keywords: MPEG-7, XML database systems, multimedia databases 



7 Information delivery systems: an exploration of Web pull and push technologies ■ 
Julie E. Kendall, Kenneth E. Kendall 
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Full text available: ^ pdf(658.33 KB) Additional Information: full citation , references , citings, index terms 



8 Document creation II: Techniques for authoring complex XML documents B 
Vincent Quint, Irone Vatton 

October 2004 Proceedings of the 2004 ACM symposium on Document engineering 

Full text available: ^ pdtt265.88 KB) Additional Information: full citation , abstract , references , index terms 

This paper reviews the main innovations of XML and considers their impact on the editing 
techniques for structured documents. Namespaces open the way to compound documents; 
well-formedness brings more freedom in the editing task; CSS allows style to be associated 
easily with structured documents. In addition to these innovative features the wide 
deployment of XML introduces structured documents in many new applications including 
applications where text is not the dominant content type. In Ian ... 

Keywords: CSS, XML, authoring tools, compound documents, direct manipulation, 
structured editing, style languages 



A semantic network-based design methodology for XML documents | 
Ling Feng, Elizabeth Chang, Tharam Dillon 

October 2002 ACM Transactions on Information Systems (TOIS), Volume 20 issue 4 

_ ii , , ., ui « , W oor *a i^m Additional Information: full citation , abstract , references , citings , index 
Full text available: ^ pdf(285.64 KB) terms 

The extensible Markup Language (XML) is fast emerging as the dominant standard for 
describing and interchanging data among various systems and databases on the Internet. It 
offers the Document Type Definition (DTD) as a formalism for defining the syntax and 
structure of XML documents. The XML Schema definition language, as a replacement for the 
DTD, provides more rich facilities for defining and constraining the content of XML 
documents. However, it does not concentrate on the semantics that und ... 

Keywords: XML, XML Schema, conceptual modeling, design methodology, semantic 
network 



10 XRel: a path-based approach to storage and retrieval of XML documents using 
relational databases 

August 2001 ACM Transactions on Internet Technology (TOIT), volume l issue l 

r- „ * ^ u. ft orr ./ox Additional Information: full citation , abstract , references , citings, index 

Full text available:^ pdf(264.27 KB) : 
L±H ^ terms , review 

This article describes XRel, a novel approach for storage and retrieval of XML documents 
using relational databases. In this approach, an XML document is decomposed into nodes 
on the basis of its tree structure and stored in relational tables according to the node type, 
with path information from the root to each node. XRel enables us to store XML documents 
using a fixed relational schema without any information about DTDs and also to utilize 
indices such as the B+ 

Keywords: XML query, XPath, text markup, text tagging 



11 Implementing catalog clearinghouses with XML and XSL 
Andrew V. Royappa 

February 1999 Proceedings of the 1999 ACM symposium on Applied computing 

Full text available: Q pdf(753.90 KB) Additional Information: full citation , references , citings , index terms 
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Keywords: SGML, XML, XSL, e-commerce 



12 Dynamic views of SGML tagged documents I 
B. Fraser, J. Roberts, G. Pianosi, P. Alencar, D. Cowan, D. German, L. Nova 
October 1999 Proceedings of the 17th annual international conference on Computer 
documentation 

r- .. * ^ , u en mo™ 4nvB\ Additional Information: full citation , abstract, references , citings, index 

Full text available: fju pdf(818.10 KB) 

terms 

Product information is more frequently being delivered as hypertext webs or documents 
because of the availability of the World-Wide Web and the associated communications 
infrastructure. However, this type of document with its large number of files and hyperlinks 
can become very complex and present significant usability problems for the creator, 
maintainer and user. Because of this complexity it becomes extremely difficult to implement 
and maintain dynamic views of a document, a supposed adv ... 

Keywords: SGML, SQL, World-Wide Web, XML, documentation, dynamic views, hyperlinks, 
relational databases, tagging languages, usability 



13 Streams, structures, spaces, scenarios, societies (5s): A formal model for digital 
libraries 

Marcos Andre Gongalves, Edward A. Fox, Layne T. Watson, Neill A. Kipp 

April 2004 ACM Transactions on Information Systems (TOIS), Volume 22 issue 2 

<- ... ^ u. 0 ^ eflE i/m Additional Information: full citation , abstract , references , citings , index 

Full text available: TOpdf(316.85 KB) 

^ terms 

Digital libraries (DLs) are complex information systems and therefore demand formal 
foundations lest development efforts diverge and interoperability suffers. In this article, we 
propose the fundamental abstractions of Streams, Structures, Spaces, Scenarios, and 
Societies (5S), which allow us to define digital libraries rigorously and usefully. Streams are 
sequences of arbitrary items used to describe both static and dynamic (e.g., video) content. 
Structures can be viewed as labeled directed gra ... 

Keywords: applications., definitions, foundations, taxonomy 



14 XML access control: A bitmap-based access control for restricted views of XML 
documents 

Abhilash Gummadi, Jong P. Yoon, Biren Shah, Vijay Raghavan 

October 2003 Proceedings of the 2003 ACM workshop on XML security 

Full text available: ^ pdf(268.58 KB) Additional Information: full citation , abstract , references , index terms 

The information on the web is growing at a very fast pace. In this ever-accumulating data, 
the volume of information represented in XML format is on the rise in recent times. An 
organization that puts forth its information on the web in XML format has several issues to 
take into account such as limiting the view of intended audience to only relevant portions of 
the documents. To address this problem, we propose the concept of "Restricted views" to 
implement security in XML documents. This could ... 

Keywords: XML, access control, bitmap, restricted views, security, security cube 



15 Interactive mathematics via the Web using MathML 
Francis J. Wright 

June 2000 ACM SIGSAM Bulletin, Volume 34 Issue 2 

Full text available: ^ pdf(1.07 MB) Additional Information: full citation , abstract , index terms 
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MathML is a mathematical markup language intended for displaying mathematics in web 
browsers. At present, it can be used to display mathematics generated dynamically in 
response to interactive queries only if the browsing and generating facilities are chosen 
carefully. This paper examines the background and possible options, and describes some of 
the details of the use of MathML to display the output from a web-based demonstration of 
an ordinary differential equation solver running in REDUCE ... 

16 Managing the software design documents with XML 
Junichi Suzuki, Yoshikazu Yamamoto 

September 1998 Proceedings of the 16th annual international conference on Computer 
documentation 

. Full text available: ^ pdf(1.09 MB) Additional Information: full citation , references , index terms 



Keywords: CASE data interchange, UML, XML, software model interchange 



17 An intelligent distributed environment for active learning 

August 2001 Journal on Educational Resources in Computing (JERIC) 

Full text available: flBpdffl 16.71 KB) Additional Information: full citation , abstract, references , citings, index 
fe^r*" terms , review 

Active learning is an effective learning approach. In this article we present an intelligent 
agent-assisted environment for active learning to better support the student-centered, 
selfpaced, and highly interactive learning approach. The environment uses the students 
learningrelated profile such as learning style and background knowledge in selecting, 
organizing, and presenting learning material, and it adopts a new approach to course 
content organization and delivery based on smart instruct ... 

Keywords: XML, active learning, multiagent system, web-based education 



18 Contributed articles: Resource description framework: metadata and its applications 
K. Selguk Candan, Huan Liu, Reshma Suvarna 

July 2001 ACM SIGKDD Explorations Newsletter, Volume 3 issue i 
Full text available: ^ pdf(1.02 MB) Additional Information: full citation , abstract , references , citings 

Universality, the property of the Web that makes it the largest data and information source 
in the world, is also the property behind the lack of a uniform organization scheme that 
would allow easy access to data and information. A semantic web, wherein different 
applications and Web sites can exchange information and hence exploit Web data and 
information to their full potential, requires the information about Web resources to be 
represented in a detailed and structured manner. Resource Descrip ... 

Keywords: Resource Description Framework (RDF), Web, XML, metadata, semantic web 




19 Technology to enable learning I: SVG for educational simulations 
Daniel S. Bogaard, Ronald P. Vullo, Christopher D. Cascioli 

October 2004 Proceedings of the 5th conference on Information technology education 

Full text available: ^ pdf(234.58 KB) Additional Information: full citation , abstract , references , index terms 

Helping students to understand complex ideas will always be problematic for teaching 
professionals. Often, the students can be limited by not only their imagination, but by their 
experiences. When trying to explain something that is outside of the students' imagination, 
it is often helpful to have either simple animations or even interactive simulations that the 
students can explore. The creation of interactive environments and the use of animation 
can greatly help educators get their point a ... 
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Keywords: animation, instructional software, scalable vector graphics, simulations 

20 An intelligent distributed environment for active learning 
Yi Shang, Hongchi Shi, Su-Shing Chen 

April 2001 Proceedings of the tenth international conference on World Wide Web 

Full text available: ^ pdf(200.31 KB) Additional Information: full citation , references , citings , index terms 

Keywords: XML, active learning, multi-agent system, web-based education 
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21 Document querying and transformation: Lazv XSL transformations 
Steffen Schott, Markus L. Noga 

November 2003 Proceedings of the 2003 ACM symposium on Document engineering 

Full text available: ^ pdf(335.83 KB) Additional Information: full citation , abstract , references , index terms 

We introduce a lazy XSLT interpreter that provides random access to the transformation 
result. This allows efficient pipelining of transformation sequences. Nodes of the result tree 
are computed only upon initial access. As these computations have limited fan-in, sparse 
output coverage propagates backwards through the pipeline. In comparative measurements 
with traditional eager implementations, our approach is on par for complete coverage and 
excels as coverage becomes sparser. In contrast to eag ... 

22 Declarative specification of Web sites with S 

Mary Fernandez, Daniela Florescu, Alon Levy, Dan Suciu 

March 2000 The VLDB Journal — The International Journal on Very Large Data Bases, 

Volume 9 Issue 1 

Full text available: fjp pdf(1 88.65 KB) Additional Information: full citation , abstract , citings, index terms 

S is a system for implementing data -intensive Web sites, which typically integrate 
information from multiple data sources and have complex structure. S's key idea is 
separating the management of a Web site's data, the specification of its content and 
structure, and the visual representation of its pages. S provides a declarative query 
language for specifying a site's content and structure, and a simple template language for 
specifying a site's HTML representation. This paper ... 



Keywords: Declarative query languages, Web-site management 



23 STOP: light on the history of outlining B 
Jonathan Price 

August 1999 ACM SIGDOC Asterisk Journal of Computer Documentation, Volume 23 issue 3 
Full text available:^ pdf(937.39 KB) Additional Information: full citation , index terms 



24 Managing change on the web 

Luis Francisco-Revilla, Frank Shipman, Richard Furuta, Unmil Karadkar, Avitai Arora 

January 2001 Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries 

F lit xt v ilabl df(274 89KB) Additional Information: full citation , abstract , references , citings , index 
u e aval a e.-[^£_x — = 1 terms 
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Increasingly, digital libraries are being defined that collect pointers to World-Wide Web 
based resources rather than hold the resources themselves. Maintaining these collections is 
challenging due to distributed document ownership and high fluidity. Typically a collections 
maintainer has to assess the relevance of changes with little system aid. In this paper, we 
describe the Waldens Paths Path Manager, which assists a maintainer in discovering when 
relevant changes occur to linked resour ... 



Keywords: Walden's path, path maintenance 



25 XML indexing and compression: D(k)-index: an adaptive structural summary for graph- 
structured data 

Qun Chen, Andrew Lim, Kian Win Ong 

June 2003 Proceedings of the 2003 ACM SIGMOD i nternational conference on 
Management of data 

Full text available: ^|pdf(217.30 KB) Additional Information: full citation , abstract , references , index terms 

To facilitate queries over semi-structured data, various structural summaries have been 
proposed. Structural summaries are derived directly from the data and serve as indices for 
evaluating path expressions on semi-structured or XML data. We introduce the D(k) index, 
an adaptive structural summary for general graph structured documents. Building on 
previous work, 1-index and A(k) index, the D(k)-index is also based on the concept of 
bisimilarity. However, as a generalization of the 1-index and A ... 

26 Evaluating the reverse engineering capabilities of Web tools for understanding site 

content and structure: a case study 
Scott Tilley, Shihong Huang 

July 2001 Proceedings of the 23rd International Conference on Software Engineering 

Full text available: J| pdf(360.47 KB) Additional Information: full citation , abstract , references , citings , index 
f P Publisher Site 

This paper describes an evaluation of the reverse engineering capabilities of three Web 
tools for understanding site content and structure. The evaluation is based on partitioning 
Web sites into three classes (static, interactive, and dynamic), and is structured using an 
existing reverse engineering environment framework (REEF). This case study also 
represents an initial evaluation of the applicability of the REEF in the related but 
qualitatively different domain of Web sites. The case stu ... 

27 Re-engineering structures from Web documents 
Chuang-Hue Moh, Ee-Peng Lim, Wee-Keong Ng 

June 2000 Proceedings of the fifth ACM conference on Digital libraries 

Full text avallable:1 BDdff180.9S KB) Additional Information: full citation , abstract, references , citings, index 
^ terms 

To realize a wide range of applications (including digital libraries) on the Web, a more 
structured way of accessing the Web is required and such requirement can be facilitated by 
the use of XML standard. In this paper, we propose a general framework for reverse 
engineering (or re-engineering) the underlying structures i.e., the DTD from a collection of 
similarly structured XML documents when they share some common but unknown DTDs. 
The essential data structures and algorithms for ... 



Keywords: Web information discovery, XML 



28 Research sessions: path indexing: APEX: an adaptive path index for XML data 
Chin-Wan Chung, Jun-Ki Min, Kyuseok Shim 

June 2002 Proceedings of the 2002 ACM SIGMOD i nternational conference on 
Management of data 
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Full text available: ^| pdfH.16 MB) Additional Information: full citation , abstract , references , citings , index 

terms 

The emergence of the Web has increased interests in XML data. XML query languages such 
as XQuery and XPath use label paths to traverse the irregularly structured data. Without a 
structural summary and efficient indexes, query processing can be quite inefficient due to 
an exhaustive traversal on XML data. To overcome the inefficiency, several path indexes 
have been proposed in the research community. Traditional indexes generally record all 
label paths from the root element in XML data. Such path ... 

29 Preserving mapping consistency under schema changes B 
Yannis Velegrakis, J. Miller, Lucian Popa 

September 2004 The VLDB Journal — The International Journal on Very Large Data 

Bases, Volume 13 Issue 3 
Full text available: *Q pdf(327.88 KB) Additional Information: full citation , abstract 

In dynamic environments like the Web, data sources may change not only their data but 
also their schemas, their semantics, and their query capabilities. When a mapping is left 
inconsistent by a schema change, it has to be detected and updated. We present a novel 
framework and a tool (ToMAS) for automatically adapting (rewriting) mappings as schemas 
evolve. Our approach considers not only local changes to a schema but also changes that 
may affect and transform many components of a schema. Our alg ... 

30 User interfaces and services: A web interface to image-based concurrent markup using J 
image maps 

Jerzy W. Jaromczyk, Miroslaw Kowaluk, Neil Moore 

November 2004 Proceedings of the 6th annual ACM international workshop on Web 
information and data management 

Full text available: g pdf(322.28 KB) Additional Information: full citation , abstract , references , index terms 

Image-based electronic editions (IBEEs) encode manuscripts using markup based on 
digitized images of the manuscript page. Due to the complex nature of editorial 
annotations, the resulting markup rarely forms a hierarchical structure— that is, ranges 
often overlap. To support web-based access to these editions, it is desirable to synchronize 
the display of markup and related satellite data with the corresponding portions of the 
image. This goal can be supported by image maps, which associate ... 

Keywords: client-side image maps, computational geometry, concurrent markup, image- 
based electronic editions, voronoi diagram 



31 Secure and selective dissemination of XML documents 
Elisa Bertino, Elena Ferrari 

August 2002 ACM Transactions on Information and System Security (TISSEC), volume 5 
Issue 3 

_ , u. « ^ 700i(m Additional Information: full citation , abstract, references , citings, index 
Full text available: ^ pdf(678.34 KB) g 

XML (extensible Markup Language) has emerged as a prevalent standard for document 
representation and exchange on the Web. It is often the case that XML documents contain 
information of different sensitivity degrees that must be selectively shared by (possibly 
large) user communities. There is thus the need for models and mechanisms enabling the 
specification and enforcement of access control policies for XML documents. Mechanisms 
are also required enabling a secure and selective dissemina ... 

Keywords: Access control, XML, secure distribution 



32 Extending Java for high-level Web service construction 
Aske Simon Christensen, Anders Moller, Michael I. Schwartzbach 
November 2003 
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ACM Transactions on Programming Languages and Systems (TOPLAS), 

Volume 25 Issue 6 

c >.* ^ •• ui 0i ^ 7 m^ Additional Information: full citation , abstract , references , citings , index 
Full text available: g pdf(947.02 KB) terms 

We incorporate innovations from the <bigwig> project into the Java language to provide 
high-level features for Web service programming. The resulting language, JWIG, contains 
an advanced session model and a flexible mechanism for dynamic construction of XML 
documents, in particular XHTML To support program development we provide a suite of 
program analyses that at compile time verify for a given program that no runtime errors 
can occur while building documents or receiving form input, and ... 

Keywords: Interactive Web services, XML, data-flow analysis 



33 Session 2: secure Web services: Designing a distributed access control processor for Q| 
network services on the Web 
Reiner Kraft 

November 2002 Proceedings of the 2002 ACM workshop on XML security 

Full text available: ^ pdf(301.14 KB) Additional Information: full citation , abstract , references , index terms 

The service oriented architecture (SOA) is gaining more momentum with the advent of 
network services on the Web. A programmable and machine accessible Web is the vision of 
many,and might represent a step towards the semantic Web. However, security is a crucial 
requirement for the serious usage and adoption of the Web services technology. This paper 
enumerates design goals for an access control model for Web services. It then introduces 
an abstract general model for Web services components, along ... 

Keywords: Web services, XML, access control, security 



34 Consistency and replication: XML three-way merge as a reconciliation engine for 

mobile data 
Tancred Lindholm 

September 2003 Proceedings of the 3rd ACM international workshop on Data 
engineering for wireless and mobile access 

Full text available: gpdf(128.03 KB) Additional Information: full citation , abstract , references , index terms 

Optimistic replication approaches are often employed on mobile devices, which raises the 
need for reconciliation of concurrently modified data. We propose that three-way merging 
algorithms, in particular those that are able to process tree-structured data in XML format, 
make good candidates for a generic data reconciliation engine on mobile devices. By 
exchanging data through XML files we impose minimal constraints on application design and 
are able to offer reconciliation services to a large num ... 

Keywords: XML, optimistic replication, reconciliation, three-way merge 



35 Document reuse and semantics: Towards a semantics for XML markup 
Allen Renear, David Dubin, C. M. Sperberg-McQueen 

November 2002 Proceedings of the 2002 ACM symposium on Document engineering 

Additional Information: full citation , abstract , references , citings , index 



Full text available: r 

^ terms 

Although XML Document Type Definitions provide a mechanism for specifying, in machine- 
readable form, the syntax of an XML markup language, there is no comparable mechanism 
for specifying the semantics of an XML vocabulary. That is, there is no way to characterize 
the meaning of XML markup so that the facts and relationships represented by the 
occurrence of XML constructs can be explicitly, comprehensively, and mechanically 
identified. This has serious practical and theoretical consequence ... 
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Keywords: SGML, XML, knowledge representation, markup, semantics 

36 innovative Document Systems: TabulaMaaica: an integrated approach to manage 

complex tables 
Horst Silberhorn 

November 2001 Proceedings of the 2001 ACM Symposium on Document engineering 

Full text available: *gj pdfd 38.97 KB) Additional Information: full citation , abstract , references , index terms 

Tables are a special part of documents and specific means have been developed to manage 
them. Step by step, the underlying models to edit and format tables have been improved or 
supplemented by new ones. These models led to a wide variety of table formats and 
produced "tabular legacies", making it difficult to edit, use, or modify tables in varying 
formats. It is even more time-consuming to convert them for various media or to unify or 
compare tabular information. Our approach to tackle these pr ... 

Keywords: WYSIWYG editor, separation of structure and presentation, table processing, 
tabular legacies 



37 Component selection and matching for IP-based design 
G. Martin, R. Seepold, T. Zhang, L. Benini, G. De Micheli 

March 2001 Proceedings of the conference on Design, automation and test in Europe 

Full text available: ^ pdf(1 70.22 KB) Additional Information: full citation , references , index terms 



38 Implementing incremental code migration with XML 
Wolfgang Emmerich, Cecilia Mascolo, Anthony Finkelstein 

June 2000 Proceedings of the 22nd international conference on Software engineering 

r- ... * u, « ^o, oc .m Additional Information: full citation , abstract, references , citings, jndex 
Full text available: ^ pdfd 24.85 KB) terms 

We demonstrate how XML and related technologies can be used for code mobility at any 
granularity, thus overcoming the restrictions of existing approaches. By not fixing a 
particular granularity for mobile code, we enable complete programs as well as individual 
lines of code to be sent across the network. We define the concept of incremental code 
mobility as the ability to migrate and add, remove, or replace code fragments (i.e., 
increments) in a remote program. The combination of fine-grain ... 

Keywords: XML technologies, incremental code migration 



39 Hypermedia semantics: Which semantic web? 
Catherine C. Marshall, Frank M. Shipman 

August 2003 Proceedings of the fourteenth ACM conference on Hypertext and 
hypermedia 

r- .. . ^ u. a .x/oon ™ Additional Information: full citation , abstract , references , citings, index 
Full text available: g pdf(329.99 KB) foam 

Through scenarios in the popular press and technical papers in the research literature, the 
promise of the Semantic Web has raised a number of different expectations. These 
expectations can be traced to three different perspectives on the Semantic Web. The 
Semantic Web is portrayed as: (1) a universal library, to be readily accessed and used by 
humans in a variety of information use contexts; (2) the backdrop for the work of 
computational agents completing sophisticated activities on behalf oft ... 

Keywords: digital libraries, hypertext, information systems, knowledge acquisition, 
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40 Structured document storage and refined declarative and navigational access 
mechanisms in HyperStorM 

Kiemens Bohm, Karl Aberer, Erich J. Neuhold, Xiaoya Yang 

November 1997 The VLDB Journal — The International Journal on Very Large Data 

Bases, Volume 6 Issue 4 
Full text available:^ pdfd 84. 18 KB) Additional Information: full citation , abstract , index terms 

The combination of SGML and database technology allows to refine both declarative and 
navigational access mechanisms for structured document collection: with regard to 
declarative access, the user can formulate complex information needs without knowing a 
query language, the respective document type definition (DTD) or the underlying modelling. 
Navigational access is eased by hyperlink-rendition mechanisms going beyond plain link- 
integrity checking. With our approach, the database-internal repres ... 

Keywords: Document query languages, Navigation, OODBMSs, SGML 
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41 Effective compression for the web: exploiting document linkages 
Raymond Wan, Alistair Moffat 

January 2001 Proceedings of the 12th Australasian conference on Database 
technologies 

Full text available:*^ pdf(844.81 KB) 



Additional Information: full citation , abstract , references , index terms 



1 Publisher Site 



Providing the infrastructure that supports the WorM-Wide Web is expensive. The costs 
incurred in running a web site include those associated with the content being served; 
those associated with the hardware that supports the site; and the network costs incurred 
in transmitting that content to the end consumers. In this work we examine mechanisms 
for compressing web content so as to reduce the third of these three costs, and describe a 
scheme that exploits the known connectivities between web pag ... 

42 Innovative Document Systems: The multivalent browser: a platform for new ideas 
Thomas A. Phelps, Robert Wilensky 

November 2001 Proceedings of the 2001 ACM Symposium on Document engineering 

Additional Information: full citation , abstract , references , citings , index 
terms 



Full text available: gpdff 188.51 KB) 



The Multivalent Browser is built on a architecture that separates functionality from concrete 
document format. Almost all functionality is made available via relatively small modules of 
code called behaviors that programmers can write to extend the core system. Behaviors can 
be as significant and powerful as parser-renderers for scanned paper, HTML, or TeX DVI; as 
fine-grained as hyperlinks, cookies, and the disabling of menu items; and as innovative or 
uncommon as in situ annotatins, "lenses", ... 



Keywords: annotation, architecture, digital, document, multivalent behavior, paper, 
scanned 



43 Efficient filtering of XML documents with XPath expressions Q 
C.-Y. Chan, P. Felber, M. Garofalakis, R. Rastogi 

December 2002 The VLDB Journal — The International Journal on Very Large Data 

Bases, Volume 11 Issue 4 
Full text available: Q pdf(383.34 KB) Additional Information: full citation , abstract , index terms 

The publish/subscribe paradigm is a popular model for allowing publishers (i.e., data 
generators) to selectively disseminate data to a large number of widely dispersed 
subscribers (i.e., data consumers) who have registered their interest in specific information 
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items. Early publish/subscribe systems have typically relied on simple subscription 
mechanisms, such as keyword or "bag of words" matching, or simple comparison predicates 
on attribute values. The emergence of XML as a standar ... 

Keywords: Data dissemination, Document filtering, Index structure, XML, XPath 

44 Integrating and customizing heterogeneous e-commerce applications Q 
Anat Eyal, Tova Milo 

August 2001 The VLDB Journal — The International Journal on Very Large Data Bases, 

Volume 10 Issue 1 

Full text available: ^ pdf(286.63 KB) Additional Information: full citation , abstract , index terms 

A broad spectrum of electronic commerce applications is currently available on the Web, 
providing services in almost any area one can think of. As the number and variety of such 
applications grow, more business opportunities emerge for providing new services based on 
the integration and customization of existing applications. (Web shopping malls and support 
for comparative shopping are just a couple of examples.) Unfortunately, the diversity of 
applications in each specific domain and the dispar ... 

Keywords: Application integration, Data integration, Electronic commerce 



45 The other formalization of law: SGML modelling and tagging 
Daniel Poulin, Guy Huard, Alain Lavoie 

June 1997 Proceedings of the sixth international conference on Artificial intelligence 
and law 

Full text available: Qpdf(1.03 MB) Additional Information: full citation , references , citings , index terms 



Keywords: SGML, information searching, intelligent law information systems, law 
information systems 

46 A programmable editor for developing structured documents based on bidirectional Q 
transformations 

Zhenjiang Hu, Shin-Cheng Mu, Masato Takeichi 

August 2004 Proceedings of the 2004 ACM SIGPLAN symposium on Partial evaluation 
and semantics-based program manipulation 

Full text available: *g pdf(397.19 KB) Additional Information: full citation , abstract , references , index terms 

This paper presents a novel editor supporting interactive refinement in the development of 
structured documents. The user performs a sequence of editing operations on the document 
view, and the editor automatically derives an efficient and reliable document source and a 
transformation that produces the document view. The editor is unique in its 
programmability, in the sense that the transformation can be obtained through editing 
operations. The main tricks behind are the utilization of the view- ... 

Keywords: bidirectional transformation, document engineering, editor, functional 
programming, view updating 



47 Image Retrieval from the World Wide Web: Issues. Techniques, and Systems 
M. L. Kherfi, D. Ziou, A. Bernardi 

March 2004 ACM Computing Surveys (CSUR), volume 36 issue l 

Full text available: ^| pdf(294.13 KB) Additional Information: full citation , abstract , references , index terms 

With the explosive growth of the World Wide Web, the public is gaining access to massive 
amounts of information. However, locating needed and relevant information remains a 
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difficult task, whether the information is textual or visual. Text search engines have existed 
for some years now and have achieved a certain degree of success. However, despite the 
large number of images available on the Web, image search engines are still rare. In this 
article, we show that in order to allow people to profi ... 

Keywords: Image-retrieval, World Wide Web, crawling, feature extraction and selection, 
indexing, relevance feedback, search, similarity 



48 Modeling methodology b: XML-based modeling and simulation: using XML for 
simulation modeling 
Paul A. Fishwick 

December 2002 Proceedings of the 34th conference on Winter simulation: exploring 
new frontiers 

Full text available: pdf(203.09 KB) Additional Information: full citation , abstract , references 



XML represents a new way of organizing information and knowledge of the World Wide 
Web, using markup languages. Whereas HTML is used for presentation-specific content, 
XML builds upon its SGML lineage to separate content from presentation, and provide a 
semantic labeling for elements that comprise a document. With XML, the concept of 
"document" is broadened to include an encapsulation of information and knowledge, and 
not only a flat medium. This suggests that XML can be used for model specif ... 

49 P7: Open-source documentation: in search of user-driven, iust-in-time writing 
Erik Berglund, Michael Priestley 

October 2001 Proceedings of the 19th annual international conference on Computer 
documentation 

Additional Information: full citation , abstract , references , citings , index 



Full text available: „^ , 

terms 

Iterative development models allow developers to respond quickly to changing user 
requirements, but place increasing demands on writers who must handle increasing 
amounts of change with ever-decreasing resources. In the software development world, 
one solution to this problem is open-source development: allowing the users to set 
requirements and priorities by actually contributing to the development of the software. 
This results in just-in-time software improvements that are explicitly user-driv ... 

50 XIRQL: An XML query language based on information retrieval concepts 
Norbert Fuhr, Kai GroPjohann 

April 2004 ACM Transactions on Information Systems (TOIS), Volume 22 issue 2 

_ .a . a , ui « , fm , n , ,, m Additional Information: full citation , abstract , references , citings , index 
Full text available: ^ pdf(281 .91 KB) terms 

XIRQL ("circle") is an XML query language that incorporates imprecision and vagueness for 
both structural and content-oriented query conditions. The corresponding uncertainty is 
handled by a consistent probabilistic model. The core features of XIRQL are (1) document 
ranking based on index term weighting, (2) specificity-oriented search for retrieving the 
most relevant parts of documents, (3) datatypes with vague predicates for dealing with 
specific types of content and (4) structural vagueness f ... 

Keywords: Path algebra, XML, XQuery, probabilistic retrieval, ranked retrieval, vague 
predicates 



51 Complete answer aggregates for treelike databases: a novel approach to combine 
querying and navigation 
Holger Meuss, Klaus U. Schulz 

April 2001 ACM Transactions on Information Systems (TOIS), Volume 19 issue 2 

_ ii , . .... Additional Information: full citation , abstract , references , citings , index 
Full text available: 3 
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The use of markup languages like SGML, HTML or XML for encoding the strucutre of 
documents or linguistic data has lead to many databases where entries are adequately 
described as trees. In this context querying formalisms are interesting that offer the 
possiblity to refer both to textual content and logical structure. We consider models where 
the strucutre specified in a query is not only used as a filter, but also for selecting and 
presenting different parts of the data. If answers are formaliz ... 

Keywords: SGML, XML, answer presentation, information retrieval, logic, query languages, 
semistructured data, structured documents, tree databases, tree matching 



52 Archiving scientific data 

Peter Buneman, Sanjeev Khanna, Keishi Tajima, Wang-Chiew Tan 

March 2004 ACM Transactions on Database Systems (TODS), volume 29 issue l 

Full text available: Q pdf(745.61 KB) Additional Information: full citation , abstract , references , index terms 

Archiving is important for scientific data, where it is necessary to record all past versions of 
a database in order to verify findings based upon a specific version. Much scientific data is 
held in a hierachical format and has a key structure that provides a canonical identification 
for each element of the hierarchy. In this article, we exploit these properties to develop an 
archiving technique that is both efficient in its use of space and preserves the continuity of 
elements through versions ... 

Keywords: Keys for XML 



53 Education and training: An XML-based approach to multimedia software engineering Q 
for distance learning 

T. Arndt, S. K. Chang, A. Guercio, P. Maresca 

July 2002 Proceedings of the 14th international conference on Software engineering 
and knowledge engineering 

Full text available: ^ pdf(95.53 KB) Additional Information: full citation , abstract , references 

Multimedia Software Engineering (MSE) is a new frontier for both Software Engineering 
(SE) and Visual Languages (VL). In fact multimedia software engineering can be considered 
as the discipline for systematic specification, design, substitution and verification of visual 
patterns. Visual Languages contribute to MSE such concepts as: Visual notation for software 
specification, design and verification flow charts, ER diagrams, Petri Nets, UML visualization, 
visual programming languages etc. Multim ... 

54 Editing and authoring: A structural adviser for the XML document authoring Q 
Boris Chidlovskii 

November 2003 Proceedings of the 2003 ACM symposium on Document engineering 

Full text available: *g pdf(207.56 KB) Additional Information: full citation , abstract , references , index terms 

Since the XML format became a de facto standard for structured documents, the IT 
research and industry have developed a number of XML editors to help users produce 
structured documents in XML format. However, the manual generation of structured 
documents in XML format remains a tedious and time-consuming process because of the 
excessive verbosity and length of XML code. In this paper, we design a structural adviser 
for the XML document authoring. The adviser intervenes at any step of the ... 

Keywords: XML markup, data mining, structural pattern, suggestion 
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56 Rhetoric of present sinale-sourcina methodologies 
Dave Clark 

October 2002 Proceedings of the 20th annual international conference on Computer 
documentation 

Full text available: ^ pdf(1 98.53 KB) Additional Information: full citation , abstract , references , index terms 

In this paper, I detail what Bill Hart-Davidson describes as the "anxiety" that many 
technical communicators have about implementations of single source documentation. 
Specifically, I briefly explore what I see as some of the key potential rhetorical problems 
with single sourcing, in part by drawing on real-world examples gathered from 
conversations with and shadowing of technical communicators in their workplaces. I 
address the following potential objections that we need to handle in pragmatic ... 

Keywords: documentation, single source, text reuse, theory 



57 Path sharing and predicate evaluation for high-performance XML filtering Q 
Yanlei Diao, Mehmet Altinel, Michael J. Franklin, Hao Zhang, Peter Fischer 
December 2003 ACM Transactions on Database Systems (TODS), volume 28 issue 4 

Full text available: ^ pdf(543.40 KB) Additional Information: full citation , abstract , references , index terms 

XML filtering systems aim to provide fast, on-the-fly matching of XML-encoded data to large 
numbers of query specifications containing constraints on both structure and content. It is 
now well accepted that approaches using event-based parsing and Finite State Machines 
(FSMs) can provide the basis for highly scalable structure-oriented XML filtering systems. 
The XFilter system [Altinel and Franklin 2000] was the first published FSM-based XML 
filtering approach. XFilter used a separate FSM per pa ... 

Keywords: Nondeterministic Finite Automaton, XML filtering, content-based matching, 
nested path expressions., path sharing, predicate evaluation, structure matching 



58 Hypertext versioning: The molhado hypertext versioning system 
Tien N. Nguyen, Ethan V. Munson, John T. Boyland 

August 2004 Proceedings of the fifteenth ACM conference on Hypertext & hypermedia 

Full text available: ^pdf(943.36 KB) Additional Information: full citation , abstract , references , index terms 

This paper describes Molhado, a hypertext versioning and software configuration 
management system that is distinguished from previous systems by its flexible product 
versioning and structural configuration management model. The model enables a unified 
versioning framework for atomic and composite software artifacts, and hypermedia 
structures among them in a fine-grained manner at the logical level. Hypermedia structures 
are managed separately from documents 1 contents. Molhado explicitly r ... 

Keywords: hypertext versioning, software configuration management, software 
engineering, version control 
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Victor Vianu 

May 2001 Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on 
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Full text available: ^ pdf(282.10 KB) Additional Information: full citation , references , citings , index terms 
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60 An architecture for secure wide-area service discovery 

Todd D. Hodes, Steven E. Czerwinski, Ben Y. Zhao, Anthony D. Joseph, Randy H. Katz 
March 2002 Wireless Networks, Volume 8 issue 2/3 

Full text available: ^ pdf(365.68 KB) Additional Information: full citation , abstract , references , index terms 

The widespread deployment of inexpensive communications technology, computational 
resources in the networking infrastructure, and network-enabled end devices poses an 
interesting problem for end users: how to locate a particular network service or device out 
of hundreds of thousands of accessible services and devices. This paper presents the 
architecture and implementation of a secure wide-area Service Discovery Service (SDS). 
Service providers use the SDS to advertise descriptions of available ... 

Keywords: location services, name lookup, network protocols, service discovery 
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